Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 633 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 49.6 KiB |
| Average record size in memory | 80.2 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 3 |
Emp Name has a high cardinality: 624 distinct values | High cardinality |
Gross Load is highly correlated with Gross Load Adjusted | High correlation |
Gross Load Adjusted is highly correlated with Gross Load | High correlation |
Emp Name is uniformly distributed | Uniform |
Emp Code has unique values | Unique |
New Customers has 176 (27.8%) zeros | Zeros |
New/Existing Customers' has 64 (10.1%) zeros | Zeros |
Inc. Gross Sales has 34 (5.4%) zeros | Zeros |
Gross Load has 56 (8.8%) zeros | Zeros |
Gross Load Adjusted has 19 (3.0%) zeros | Zeros |
Digital Activation Count - CF has 85 (13.4%) zeros | Zeros |
Reproduction
| Analysis started | 2021-02-01 08:15:07.658284 |
|---|---|
| Analysis finished | 2021-02-01 08:15:13.293153 |
| Duration | 5.63 seconds |
| Software version | pandas-profiling v2.10.0 |
| Download configuration | config.yaml |
| Distinct | 633 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6157.672986 |
|---|---|
| Minimum | 368 |
| Maximum | 8488 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.1 KiB |
Quantile statistics
| Minimum | 368 |
|---|---|
| 5-th percentile | 873 |
| Q1 | 5988 |
| median | 6720 |
| Q3 | 8168 |
| 95-th percentile | 8440.4 |
| Maximum | 8488 |
| Range | 8120 |
| Interquartile range (IQR) | 2180 |
Descriptive statistics
| Standard deviation | 2544.795403 |
|---|---|
| Coefficient of variation (CV) | 0.4132722554 |
| Kurtosis | 0.2793336296 |
| Mean | 6157.672986 |
| Median Absolute Deviation (MAD) | 1348 |
| Skewness | -1.285544972 |
| Sum | 3897807 |
| Variance | 6475983.641 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 8191 | 1 | 0.2% |
| 684 | 1 | 0.2% |
| 8482 | 1 | 0.2% |
| 8478 | 1 | 0.2% |
| 8477 | 1 | 0.2% |
| 6233 | 1 | 0.2% |
| 8474 | 1 | 0.2% |
| 6424 | 1 | 0.2% |
| 8471 | 1 | 0.2% |
| 8469 | 1 | 0.2% |
| Other values (623) | 623 |
| Value | Count | Frequency (%) |
| 368 | 1 | |
| 509 | 1 | |
| 516 | 1 | |
| 552 | 1 | |
| 554 | 1 |
| Value | Count | Frequency (%) |
| 8488 | 1 | |
| 8484 | 1 | |
| 8482 | 1 | |
| 8478 | 1 | |
| 8477 | 1 |
| Distinct | 624 |
|---|---|
| Distinct (%) | 98.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.1 KiB |
| Muhammad Usman | 3 |
|---|---|
| Ali Raza | 3 |
| Hassan Raza | 2 |
| Muhammad Ali | 2 |
| Muhammad Noman | 2 |
| Other values (619) |
Length
| Max length | 31 |
|---|---|
| Median length | 13 |
| Mean length | 13.92733017 |
| Min length | 5 |
Characters and Unicode
| Total characters | 8816 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 617 ? |
|---|---|
| Unique (%) | 97.5% |
Sample
| 1st row | Shahid Mustafa |
|---|---|
| 2nd row | Hafiz Muhammad Omer Zia |
| 3rd row | Mohsin Arif |
| 4th row | Agha Umaid Raza |
| 5th row | Anum Tariq |
| Value | Count | Frequency (%) |
| Muhammad Usman | 3 | 0.5% |
| Ali Raza | 3 | 0.5% |
| Hassan Raza | 2 | 0.3% |
| Muhammad Ali | 2 | 0.3% |
| Muhammad Noman | 2 | 0.3% |
| Muhammad Junaid | 2 | 0.3% |
| Abdullah | 2 | 0.3% |
| Ahsan Saleem | 1 | 0.2% |
| Chauhdary Muhammad Younas | 1 | 0.2% |
| Umer Farooq | 1 | 0.2% |
| Other values (614) | 614 |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| muhammad | 105 | 7.2% |
| ali | 58 | 4.0% |
| khan | 39 | 2.7% |
| syed | 32 | 2.2% |
| ahmed | 23 | 1.6% |
| hussain | 16 | 1.1% |
| hassan | 15 | 1.0% |
| usman | 13 | 0.9% |
| iqbal | 12 | 0.8% |
| raza | 12 | 0.8% |
| Other values (606) | 1132 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1593 | |
| 825 | 9.4% | |
| i | 581 | 6.6% |
| h | 547 | 6.2% |
| m | 516 | 5.9% |
| d | 395 | 4.5% |
| n | 374 | 4.2% |
| e | 367 | 4.2% |
| r | 351 | 4.0% |
| u | 300 | 3.4% |
| Other values (40) | 2967 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6538 | |
| Uppercase Letter | 1453 | 16.5% |
| Space Separator | 825 | 9.4% |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 1593 | |
| i | 581 | 8.9% |
| h | 547 | 8.4% |
| m | 516 | 7.9% |
| d | 395 | 6.0% |
| n | 374 | 5.7% |
| e | 367 | 5.6% |
| r | 351 | 5.4% |
| u | 300 | 4.6% |
| s | 295 | 4.5% |
| Other values (15) | 1219 |
| Value | Count | Frequency (%) |
| A | 287 | |
| S | 221 | |
| M | 175 | |
| K | 93 | 6.4% |
| H | 88 | 6.1% |
| F | 72 | 5.0% |
| N | 71 | 4.9% |
| R | 64 | 4.4% |
| Z | 53 | 3.6% |
| U | 49 | 3.4% |
| Other values (14) | 280 |
| Value | Count | Frequency (%) |
| 825 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7991 | |
| Common | 825 | 9.4% |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 1593 | |
| i | 581 | 7.3% |
| h | 547 | 6.8% |
| m | 516 | 6.5% |
| d | 395 | 4.9% |
| n | 374 | 4.7% |
| e | 367 | 4.6% |
| r | 351 | 4.4% |
| u | 300 | 3.8% |
| s | 295 | 3.7% |
| Other values (39) | 2672 |
| Value | Count | Frequency (%) |
| 825 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8816 |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 1593 | |
| 825 | 9.4% | |
| i | 581 | 6.6% |
| h | 547 | 6.2% |
| m | 516 | 5.9% |
| d | 395 | 4.5% |
| n | 374 | 4.2% |
| e | 367 | 4.2% |
| r | 351 | 4.0% |
| u | 300 | 3.4% |
| Other values (40) | 2967 |
Designation
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.1 KiB |
| Relationship Manager | |
|---|---|
| Assistant Sales Manager | |
| Sales Manager |
Length
| Max length | 23 |
|---|---|
| Median length | 20 |
| Mean length | 19.64139021 |
| Min length | 13 |
Characters and Unicode
| Total characters | 12433 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sales Manager |
|---|---|
| 2nd row | Relationship Manager |
| 3rd row | Sales Manager |
| 4th row | Assistant Sales Manager |
| 5th row | Sales Manager |
| Value | Count | Frequency (%) |
| Relationship Manager | 302 | |
| Assistant Sales Manager | 209 | |
| Sales Manager | 122 |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| manager | 633 | |
| sales | 331 | |
| relationship | 302 | |
| assistant | 209 | 14.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2108 | |
| e | 1266 | |
| s | 1260 | |
| n | 1144 | |
| 842 | 6.8% | |
| i | 813 | 6.5% |
| t | 720 | 5.8% |
| l | 633 | 5.1% |
| M | 633 | 5.1% |
| g | 633 | 5.1% |
| Other values (7) | 2381 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10116 | |
| Uppercase Letter | 1475 | 11.9% |
| Space Separator | 842 | 6.8% |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 2108 | |
| e | 1266 | |
| s | 1260 | |
| n | 1144 | |
| i | 813 | 8.0% |
| t | 720 | 7.1% |
| l | 633 | 6.3% |
| g | 633 | 6.3% |
| r | 633 | 6.3% |
| o | 302 | 3.0% |
| Other values (2) | 604 | 6.0% |
| Value | Count | Frequency (%) |
| M | 633 | |
| S | 331 | |
| R | 302 | |
| A | 209 | 14.2% |
| Value | Count | Frequency (%) |
| 842 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11591 | |
| Common | 842 | 6.8% |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 2108 | |
| e | 1266 | |
| s | 1260 | |
| n | 1144 | |
| i | 813 | 7.0% |
| t | 720 | 6.2% |
| l | 633 | 5.5% |
| M | 633 | 5.5% |
| g | 633 | 5.5% |
| r | 633 | 5.5% |
| Other values (6) | 1748 |
| Value | Count | Frequency (%) |
| 842 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12433 |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 2108 | |
| e | 1266 | |
| s | 1260 | |
| n | 1144 | |
| 842 | 6.8% | |
| i | 813 | 6.5% |
| t | 720 | 5.8% |
| l | 633 | 5.1% |
| M | 633 | 5.1% |
| g | 633 | 5.1% |
| Other values (7) | 2381 |
Region
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.1 KiB |
| Central | |
|---|---|
| South | |
| North | |
| KPK | |
| Multan |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.570300158 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3526 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Central |
|---|---|
| 2nd row | Central |
| 3rd row | North |
| 4th row | North |
| 5th row | North |
| Value | Count | Frequency (%) |
| Central | 217 | |
| South | 185 | |
| North | 118 | |
| KPK | 62 | 9.8% |
| Multan | 51 | 8.1% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| central | 217 | |
| south | 185 | |
| north | 118 | |
| kpk | 62 | 9.8% |
| multan | 51 | 8.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 571 | |
| r | 335 | |
| o | 303 | |
| h | 303 | |
| n | 268 | |
| a | 268 | |
| l | 268 | |
| u | 236 | |
| C | 217 | 6.2% |
| e | 217 | 6.2% |
| Other values (5) | 540 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2769 | |
| Uppercase Letter | 757 | 21.5% |
Most frequent character per category
| Value | Count | Frequency (%) |
| t | 571 | |
| r | 335 | |
| o | 303 | |
| h | 303 | |
| n | 268 | |
| a | 268 | |
| l | 268 | |
| u | 236 | |
| e | 217 | 7.8% |
| Value | Count | Frequency (%) |
| C | 217 | |
| S | 185 | |
| K | 124 | |
| N | 118 | |
| P | 62 | 8.2% |
| M | 51 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3526 |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 571 | |
| r | 335 | |
| o | 303 | |
| h | 303 | |
| n | 268 | |
| a | 268 | |
| l | 268 | |
| u | 236 | |
| C | 217 | 6.2% |
| e | 217 | 6.2% |
| Other values (5) | 540 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3526 |
Most frequent character per block
| Value | Count | Frequency (%) |
| t | 571 | |
| r | 335 | |
| o | 303 | |
| h | 303 | |
| n | 268 | |
| a | 268 | |
| l | 268 | |
| u | 236 | |
| C | 217 | 6.2% |
| e | 217 | 6.2% |
| Other values (5) | 540 |
| Distinct | 15 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.015797788 |
|---|---|
| Minimum | 0 |
| Maximum | 19 |
| Zeros | 176 |
| Zeros (%) | 27.8% |
| Memory size | 5.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.263441573 |
|---|---|
| Coefficient of variation (CV) | 1.122851502 |
| Kurtosis | 7.796467086 |
| Mean | 2.015797788 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.119844251 |
| Sum | 1276 |
| Variance | 5.123167757 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=15)
| Value | Count | Frequency (%) |
| 0 | 176 | |
| 1 | 155 | |
| 2 | 114 | |
| 3 | 68 | 10.7% |
| 4 | 44 | 7.0% |
| 5 | 25 | 3.9% |
| 6 | 23 | 3.6% |
| 7 | 10 | 1.6% |
| 8 | 6 | 0.9% |
| 9 | 5 | 0.8% |
| Other values (5) | 7 | 1.1% |
| Value | Count | Frequency (%) |
| 0 | 176 | |
| 1 | 155 | |
| 2 | 114 | |
| 3 | 68 | 10.7% |
| 4 | 44 | 7.0% |
| Value | Count | Frequency (%) |
| 19 | 1 | 0.2% |
| 15 | 1 | 0.2% |
| 12 | 1 | 0.2% |
| 11 | 1 | 0.2% |
| 10 | 3 |
| Distinct | 18 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.791469194 |
|---|---|
| Minimum | 0 |
| Maximum | 22 |
| Zeros | 64 |
| Zeros (%) | 10.1% |
| Memory size | 5.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 11 |
| Maximum | 22 |
| Range | 22 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.285101085 |
|---|---|
| Coefficient of variation (CV) | 0.866445411 |
| Kurtosis | 3.7561154 |
| Mean | 3.791469194 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.579867222 |
| Sum | 2400 |
| Variance | 10.79188914 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=18)
| Value | Count | Frequency (%) |
| 1 | 107 | |
| 2 | 96 | |
| 3 | 86 | |
| 4 | 74 | |
| 0 | 64 | |
| 5 | 62 | |
| 6 | 41 | 6.5% |
| 7 | 28 | 4.4% |
| 8 | 22 | 3.5% |
| 11 | 13 | 2.1% |
| Other values (8) | 40 | 6.3% |
| Value | Count | Frequency (%) |
| 0 | 64 | |
| 1 | 107 | |
| 2 | 96 | |
| 3 | 86 | |
| 4 | 74 |
| Value | Count | Frequency (%) |
| 22 | 1 | 0.2% |
| 20 | 1 | 0.2% |
| 19 | 2 | 0.3% |
| 14 | 5 | |
| 13 | 6 |
| Distinct | 565 |
|---|---|
| Distinct (%) | 89.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4045572.907 |
|---|---|
| Minimum | -903987 |
| Maximum | 46070446 |
| Zeros | 34 |
| Zeros (%) | 5.4% |
| Memory size | 5.1 KiB |
Quantile statistics
| Minimum | -903987 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 597505 |
| median | 2127080 |
| Q3 | 5002299 |
| 95-th percentile | 15352591.4 |
| Maximum | 46070446 |
| Range | 46974433 |
| Interquartile range (IQR) | 4404794 |
Descriptive statistics
| Standard deviation | 5803295.667 |
|---|---|
| Coefficient of variation (CV) | 1.434480555 |
| Kurtosis | 15.56383604 |
| Mean | 4045572.907 |
| Median Absolute Deviation (MAD) | 1827080 |
| Skewness | 3.359102973 |
| Sum | 2560847650 |
| Variance | 3.36782406 × 1013 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 34 | 5.4% |
| 300000 | 8 | 1.3% |
| 500000 | 4 | 0.6% |
| 50000 | 4 | 0.6% |
| 5000 | 4 | 0.6% |
| 1000000 | 3 | 0.5% |
| 400000 | 3 | 0.5% |
| 100000 | 3 | 0.5% |
| 1500000 | 3 | 0.5% |
| 1150000 | 2 | 0.3% |
| Other values (555) | 565 |
| Value | Count | Frequency (%) |
| -903987 | 1 | |
| -874599 | 1 | |
| -736159 | 1 | |
| -500000 | 1 | |
| -305395 | 1 |
| Value | Count | Frequency (%) |
| 46070446 | 1 | |
| 44419675 | 1 | |
| 42595993 | 1 | |
| 37370492 | 1 | |
| 35652813 | 1 |
| Distinct | 542 |
|---|---|
| Distinct (%) | 85.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55141.20221 |
|---|---|
| Minimum | -65366 |
| Maximum | 1298416 |
| Zeros | 56 |
| Zeros (%) | 8.8% |
| Memory size | 5.1 KiB |
Quantile statistics
| Minimum | -65366 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 4972 |
| median | 21429 |
| Q3 | 57043 |
| 95-th percentile | 224864.4 |
| Maximum | 1298416 |
| Range | 1363782 |
| Interquartile range (IQR) | 52071 |
Descriptive statistics
| Standard deviation | 112550.9647 |
|---|---|
| Coefficient of variation (CV) | 2.041140928 |
| Kurtosis | 48.0680877 |
| Mean | 55141.20221 |
| Median Absolute Deviation (MAD) | 19948 |
| Skewness | 5.868597777 |
| Sum | 34904381 |
| Variance | 1.266771965 × 1010 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 56 | 8.8% |
| 989 | 5 | 0.8% |
| 494 | 5 | 0.8% |
| 49 | 4 | 0.6% |
| 4944 | 4 | 0.6% |
| 2966 | 4 | 0.6% |
| 1451 | 2 | 0.3% |
| 14508 | 2 | 0.3% |
| 4500 | 2 | 0.3% |
| 1780 | 2 | 0.3% |
| Other values (532) | 547 |
| Value | Count | Frequency (%) |
| -65366 | 1 | 0.2% |
| -7962 | 1 | 0.2% |
| 0 | 56 | |
| 49 | 4 | 0.6% |
| 50 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 1298416 | 1 | |
| 1193379 | 1 | |
| 770428 | 1 | |
| 769935 | 1 | |
| 694479 | 1 |
| Distinct | 572 |
|---|---|
| Distinct (%) | 90.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 67400.28594 |
|---|---|
| Minimum | -52866 |
| Maximum | 1315916 |
| Zeros | 19 |
| Zeros (%) | 3.0% |
| Memory size | 5.1 KiB |
Quantile statistics
| Minimum | -52866 |
|---|---|
| 5-th percentile | 1470.2 |
| Q1 | 12521 |
| median | 34394 |
| Q3 | 73265 |
| 95-th percentile | 239250.6 |
| Maximum | 1315916 |
| Range | 1368782 |
| Interquartile range (IQR) | 60744 |
Descriptive statistics
| Standard deviation | 115270.0783 |
|---|---|
| Coefficient of variation (CV) | 1.710231294 |
| Kurtosis | 44.14405023 |
| Mean | 67400.28594 |
| Median Absolute Deviation (MAD) | 26845 |
| Skewness | 5.548201278 |
| Sum | 42664381 |
| Variance | 1.328719094 × 1010 |
| Monotocity | Decreasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 19 | 3.0% |
| 2500 | 18 | 2.8% |
| 5000 | 9 | 1.4% |
| 7500 | 7 | 1.1% |
| 15989 | 3 | 0.5% |
| 4944 | 3 | 0.5% |
| 20000 | 3 | 0.5% |
| 5466 | 3 | 0.5% |
| 7549 | 2 | 0.3% |
| 12360 | 2 | 0.3% |
| Other values (562) | 564 |
| Value | Count | Frequency (%) |
| -52866 | 1 | 0.2% |
| 0 | 19 | |
| 198 | 1 | 0.2% |
| 297 | 1 | 0.2% |
| 494 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 1315916 | 1 | |
| 1208379 | 1 | |
| 785428 | 1 | |
| 779935 | 1 | |
| 696979 | 1 |
| Distinct | 33 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.903633491 |
|---|---|
| Minimum | 0 |
| Maximum | 49 |
| Zeros | 85 |
| Zeros (%) | 13.4% |
| Memory size | 5.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 6 |
| 95-th percentile | 15 |
| Maximum | 49 |
| Range | 49 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 5.82567452 |
|---|---|
| Coefficient of variation (CV) | 1.188032207 |
| Kurtosis | 13.58076572 |
| Mean | 4.903633491 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 3.017197276 |
| Sum | 3104 |
| Variance | 33.93848361 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=33)
| Value | Count | Frequency (%) |
| 1 | 91 | |
| 2 | 88 | |
| 0 | 85 | |
| 3 | 68 | |
| 4 | 60 | |
| 6 | 47 | |
| 5 | 46 | |
| 7 | 28 | 4.4% |
| 8 | 22 | 3.5% |
| 9 | 17 | 2.7% |
| Other values (23) | 81 |
| Value | Count | Frequency (%) |
| 0 | 85 | |
| 1 | 91 | |
| 2 | 88 | |
| 3 | 68 | |
| 4 | 60 |
| Value | Count | Frequency (%) |
| 49 | 1 | |
| 43 | 1 | |
| 40 | 1 | |
| 37 | 1 | |
| 36 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Emp Code | Emp Name | Designation | Region | New Customers | New/Existing Customers' | Inc. Gross Sales | Gross Load | Gross Load Adjusted | Digital Activation Count - CF | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1097 | Shahid Mustafa | Sales Manager | Central | 19 | 22 | 46070446 | 1298416 | 1315916 | 7 |
| 1 | 8214 | Hafiz Muhammad Omer Zia | Relationship Manager | Central | 6 | 8 | 42595993 | 1193379 | 1208379 | 6 |
| 2 | 914 | Mohsin Arif | Sales Manager | North | 2 | 3 | 25178905 | 770428 | 785428 | 6 |
| 3 | 6895 | Agha Umaid Raza | Assistant Sales Manager | North | 2 | 3 | 25074550 | 769935 | 779935 | 4 |
| 4 | 5805 | Anum Tariq | Sales Manager | North | 2 | 5 | 22877214 | 694479 | 696979 | 1 |
| 5 | 6829 | Intikhab Ahmad | Sales Manager | Central | 11 | 13 | 26551881 | 666281 | 696281 | 12 |
| 6 | 1113 | Waqar Khan | Assistant Sales Manager | North | 0 | 4 | 16116093 | 557494 | 567494 | 4 |
| 7 | 6979 | Muhammad Usman | Assistant Sales Manager | North | 5 | 6 | 23449350 | 469199 | 501699 | 13 |
| 8 | 8121 | Waleed Bin Tariq | Relationship Manager | Central | 4 | 4 | 15834814 | 457605 | 467605 | 4 |
| 9 | 6467 | Amna Zameer | Relationship Manager | North | 2 | 3 | 14108569 | 428077 | 428077 | 0 |
Last rows
| Emp Code | Emp Name | Designation | Region | New Customers | New/Existing Customers' | Inc. Gross Sales | Gross Load | Gross Load Adjusted | Digital Activation Count - CF | |
|---|---|---|---|---|---|---|---|---|---|---|
| 623 | 8117 | Muhammad Arif Khan | Relationship Manager | South | 0 | 0 | 0 | 0 | 0 | 0 |
| 624 | 8252 | Mian Muhammad Yasir Saleem | Assistant Sales Manager | Central | 0 | 0 | 0 | 0 | 0 | 0 |
| 625 | 8268 | Jamal Uddin | Relationship Manager | South | 0 | 0 | 0 | 0 | 0 | 0 |
| 626 | 8288 | Muhammad Humair | Relationship Manager | South | 1 | 1 | 100000 | 0 | 0 | 0 |
| 627 | 8336 | Razia Nawab | Relationship Manager | North | 0 | 0 | 0 | 0 | 0 | 0 |
| 628 | 8353 | Muhammad Faizan Khurshid Abbasi | Relationship Manager | North | 0 | 0 | 0 | 0 | 0 | 0 |
| 629 | 8357 | Saad Bin Ghani | Relationship Manager | South | 0 | 0 | 0 | 0 | 0 | 0 |
| 630 | 8397 | Mehnaz Hameed | Relationship Manager | Multan | 1 | 1 | 6000000 | 0 | 0 | 0 |
| 631 | 8460 | Tahseen Khan | Assistant Sales Manager | South | 1 | 1 | 500000 | 0 | 0 | 0 |
| 632 | 772 | Muhammad Imran Qayyum | Sales Manager | Central | 0 | 3 | 779666 | -65366 | -52866 | 5 |